64 research outputs found

    beadarrayFilter : an R package to filter beads

    Get PDF
    Microarrays enable the expression levels of thousands of genes to be measured simultaneously. However, only a small fraction of these genes are expected to be expressed under different experimental conditions. Nowadays, filtering has been introduced as a step in the microarray preprocessing pipeline. Gene filtering aims at reducing the dimensionality of data by filtering redundant features prior to the actual statistical analysis. Previous filtering methods focus on the Affymetrix platform and can not be easily ported to the Illumina platform. As such, we developed a filtering method for Illumina bead arrays. We developed an R package, beadarrayFilter, to implement the latter method. In this paper, the main functions in the package are highlighted and using many examples, we illustrate how beadarrayFilter can be used to filter bead arrays

    A random effects model for the identification of differential splicing (REIDS) using exon and HTA arrays

    Get PDF
    Background: Alternative gene splicing is a common phenomenon in which a single gene gives rise to multiple transcript isoforms. The process is strictly guided and involves a multitude of proteins and regulatory complexes. Unfortunately, aberrant splicing events do occur which have been linked to genetic disorders, such as several types of cancer and neurodegenerative diseases (Fan et al., Theor Biol Med Model 3:19, 2006). Therefore, understanding the mechanism of alternative splicing and identifying the difference in splicing events between diseased and healthy tissue is crucial in biomedical research with the potential of applications in personalized medicine as well as in drug development. Results: We propose a linear mixed model, Random Effects for the Identification of Differential Splicing (REIDS), for the identification of alternative splicing events. Based on a set of scores, an exon score and an array score, a decision regarding alternative splicing can be made. The model enables the ability to distinguish a differential expressed gene from a differential spliced exon. The proposed model was applied to three case studies concerning both exon and HTA arrays. Conclusion: The REIDS model provides a work flow for the identification of alternative splicing events relying on the established linear mixed model. The model can be applied to different types of arrays

    High MIG (CXCL9) plasma levels favours response to peginterferon and ribavirin in HCV-infected patients regardless of DPP4 activity

    Get PDF
    Sustained virological response (SVR) following peginterferon (pegIFN) and ribavirin (RBV) treatment in hepatitis C virus (HCV) infected patients has been linked with the IL28B genotype and lower peripheral levels of the CXCR3-binding chemokine IP-10 (CXCL10). To further improve the understanding of these biomarkers we investigated plasma levels of the other CXCR3-binding chemokines and activity of the dipeptidyl peptidase IV (DPP4, CD26) protease, which cleaves IP-10, in relation to treatment response

    Using transcriptomics to guide lead optimization in drug discovery projects : lessons learned from the QSTAR project

    Get PDF
    The pharmaceutical industry is faced with steadily declining R&D efficiency which results in fewer drugs reaching the market despite increased investment. A major cause for this low efficiency is the failure of drug candidates in late-stage development owing to safety issues or previously undiscovered side-effects. We analyzed to what extent gene expression data can help to de-risk drug development in early phases by detecting the biological effects of compounds across disease areas, targets and scaffolds. For eight drug discovery projects within a global pharmaceutical company, gene expression data were informative and able to support go/no-go decisions. Our studies show that gene expression profiling can detect adverse effects of compounds, and is a valuable tool in early-stage drug discovery decision making

    ViVaMBC: estimating viral sequence variation in complex populations from illumina deep-sequencing data using model-based clustering

    Get PDF
    Background: Deep-sequencing allows for an in-depth characterization of sequence variation in complex populations. However, technology associated errors may impede a powerful assessment of low-frequency mutations. Fortunately, base calls are complemented with quality scores which are derived from a quadruplet of intensities, one channel for each nucleotide type for Illumina sequencing. The highest intensity of the four channels determines the base that is called. Mismatch bases can often be corrected by the second best base, i.e. the base with the second highest intensity in the quadruplet. A virus variant model-based clustering method, ViVaMBC, is presented that explores quality scores and second best base calls for identifying and quantifying viral variants. ViVaMBC is optimized to call variants at the codon level (nucleotide triplets) which enables immediate biological interpretation of the variants with respect to their antiviral drug responses. Results: Using mixtures of HCV plasmids we show that our method accurately estimates frequencies down to 0.5%. The estimates are unbiased when average coverages of 25,000 are reached. A comparison with the SNP-callers V-Phaser2, ShoRAH, and LoFreq shows that ViVaMBC has a superb sensitivity and specificity for variants with frequencies above 0.4%. Unlike the competitors, ViVaMBC reports a higher number of false-positive findings with frequencies below 0.4% which might partially originate from picking up artificial variants introduced by errors in the sample and library preparation step. Conclusions: ViVaMBC is the first method to call viral variants directly at the codon level. The strength of the approach lies in modeling the error probabilities based on the quality scores. Although the use of second best base calls appeared very promising in our data exploration phase, their utility was limited. They provided a slight increase in sensitivity, which however does not warrant the additional computational cost of running the offline base caller. Apparently a lot of information is already contained in the quality scores enabling the model based clustering procedure to adjust the majority of the sequencing errors. Overall the sensitivity of ViVaMBC is such that technical constraints like PCR errors start to form the bottleneck for low frequency variant detection

    FABIA: factor analysis for bicluster acquisition

    Get PDF
    Motivation: Biclustering of transcriptomic data groups genes and samples simultaneously. It is emerging as a standard tool for extracting knowledge from gene expression measurements. We propose a novel generative approach for biclustering called ‘FABIA: Factor Analysis for Bicluster Acquisition’. FABIA is based on a multiplicative model, which accounts for linear dependencies between gene expression and conditions, and also captures heavy-tailed distributions as observed in real-world transcriptomic data. The generative framework allows to utilize well-founded model selection methods and to apply Bayesian techniques

    Quantitation of Pretreatment Serum IP-10 Improves the Predictive Value of an IL28B Gene Polymorphism for Hepatitis C Treatment Response

    Get PDF
    Polymorphisms of IL28B gene are highly associated with sustained virological response (SVR) in patients with chronic hepatitis C treated with peginterferon and ribavirin. Quantitation of Interferon-γ Inducible Protein-10 (IP-10) may also differentiate antiviral response. We evaluated IP-10 levels in pretreatment serum from 115 non-responders and 157 sustained responders in the VIRAHEP-C cohort, including African Americans (AA) and Caucasian Americans (CA). Mean IP-10 was lower in sustained responders compared to non-responders (460 ± 37 pg/ml vs 697 ± 49 pg/ml, p600 pg/ml) was 67%. We assessed the combination of pretreatment IP-10 levels with IL28B genotype as predictors of treatment response. The IL28B polymorphism rs12979860 was tested in 210 participants. CC, CT, or TT genotypes were found in 30%, 49%, and 21%, respectively, with corresponding SVR rates of 87%, 50%, and 39% (p<0.0001). Serum IP-10 levels within the IL28B genotype groups provided additional information regarding the likelihood of SVR (p< 0.0001). CT carriers with low IP-10 had 64% SVR versus 24% with high IP-10. Similarly, a higher SVR rate was identified for TT and CC carriers with low versus high IP-10 (TT: 48% versus 20%, CC: 89% versus 79%). IL28B genotype and baseline IP-10 levels were additive but independent when predicting SVR in both AA and CA
    • 

    corecore